This Page Is Inserted by IFW Operations 
and is not a part of the Official Record 

BEST AVAILABLE IMAGES 

Defective images within this document are accurate representations of 
the original documents submitted by the applicant. 

Defects in the images may include (but are not limited to): 

• BLACK BORDERS 

• TEXT CUT OFF AT TOP, BOTTOM OR SIDES 

• FADED TEXT 

• ILLEGIBLE TEXT 

• SKEWED/SLANTED IMAGES 

• COLORED PHOTOS 

• BLACK OR VERY BLACK AND WHITE DARK PHOTOS 

• GRAY SCALE DOCUMENTS 



IMAGES ARE BEST AVAILABLE COPY. 



As rescanning documents will not correct images, 
please do hot report the images to the 
Image Problem Mailbox. 



WORLD INTELLECTUAL PROPERTY ORGANIZATION 
International Bureau 




PCX 

INTERNATIONAL APPUCATION PUBUSHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification 7 : 

C12N 1/lS, 15/10, C12P 9/00 // (C12N 
1/15, C12R 1:66) (C12N 1/15, C12R 
1;77) 



Al 



(11) Internattonal Publication Number: WO 00/50567 

(43) Internatioiial Publication Date: 3 1 August 2000 (3 1 .08.00) 



(21) InternaUonal AppUcation Number: PCT/DKOO/00063 

(22) Internattonal Filing Date: 17 Febniary 2000 (17.02.00) 



(30) Priority Data: 

PA 1999 00253 



24 Ffebniary 1999 (24.02.99) DK 



(71) Applicant: NOVO NORDISK A/S [DK/DK]; Enzyme Busi- 

ness Patents, Novo Alle, DK-2880 Bagsvaerd (DK). 

(72) Inventors: BORCHERT, Torbcn, Vedel; Vordingborggade 6a» 

31, DK-2100 Copenhagen 0 (DK). CHRISTIANSEN. 
Lars (deceased). VIND, Jespen Bagsvsrdvej 115, DK-2800 
Lyngby (DK). 



(81) Designated States: AE, AL, AM, AT. AU, AZ, BA, BB, BG, 
BR. BY, CA, CH, CN, CR, CU. CZ, DE, DK, DM, EE, 
ES, FI, OB. GD. GE. GH. GM, HR. HU, ID. IL, IN, IS, JP, 
KE, KG. KP. KR, KZ, LC, LK, LR, LS, LT, LU, LV, MA, 
MD, MG, MK, MN, MW, MX, NO, NZ, PL. PT, RO. RU, 
SD, SE. SG. SI, SK, SL, TJ. TM, TR. TT. TZ, UA. UG, 
UZ, VN, YU, ZA, ZW, ARIPO patent (GH, GM, KE. LS. 
MW, SD. SL, SZ, TZ, UG, ZW). Eurasian patent (AM, AZ. 
BY, KG, KZ, MD. RU, TJ, TM). European patent (AT. BE. 
CH, CY. DE, DK, ES, FI, FR. GB, GR, IE, IT, LU, MC, 
NL, PT, SE), OAPI patent (BF, BJ, CF. CO. CI, CM. OA, 
GN, GW, ML. MR. NE, SN. TD. TG). 



Published 

With international search report. 



(54) Title: FUNGAL CELLS WITH I NACTIVATED DNA MISMATCH REPAIR SYSTEM 

Mismatch repair inactive Mismatch repair active 





duplication 



daplioatiea 




Diversity : Two 



Divaraity: One 



(57) Abstract 



A process for making DNA libraries in filamentous fiingai cells using a novel cloned gene involved in the mismatch repair system of 
filamentous fungal cells. 



FOR THE PURPOSES OF INFORMATION ONLY 
Codes used to identify States party to the PCT on the front pages of pamphlets publishing international applications under the PCT. 





Albania 


ES 


Spain 


LS 


Lesotho 


SI 


Slovenia 


AM 


Armenia 


FI 


Finland 


LT 


Lithuania 


SK 


Stovalria 


AT 


Austria 


FR 


France 


LU 


Luxembouig 


SN 


Senegal 


AU 


Australia 


GA 


Gabon 


LV 


Latvia 


sz 


Swaziland 


AZ 


Azcftaijan 


GB 


United Kingdom 


MC 


Monaco 


TD 


Chad 


BA 


Bosnia atxl Herzegovina 


GE 


Gcoigia 


MD 


Republic of Moldova 


TG 


Togo 


BB 


Barbados 


GH 


Ghana 


MG 


Madagascar 


TJ 


Tajikistan 


BE 


Belgium 


GN 


Guinea 


MK 


The former Yugoslav 


TM 


'Hiricmenistan 


BF 


Buikina Faso 


GR 


Greece 




Republic of Macedonia 


TR 


Twkey 


BG 


Bulgaria 


HU 


Hungry 


ML 


Mali 


TT 


Trini(bd and Tobago 


BJ 


Benin 


IE 


Ireland 


MN 


Mongolia 


UA 


Ukraine 


BR 


Brazil 


IL 


TsFBel 


MR 


Mauritania 


UG 


Uganda 


BY 


Belarus 


IS 


Iceland 


MW 


Malawi 


US 


United States of America 


CA 


Canada 


IT 


Italy 


MX 


Mexico 


uz 


Uzbekistan 


CF 


Central African Republic 


JP 


J^an 


NE 


Niger 


VN 


Vict Nam 


CC 


Congo 


KE 


Kenya 


NL 


Netherlands 


YU 


Yugoslavia 


CH 


Switzerland 


KG 


Kyrgyzstan 


NO 


Norway 


ZW 


Zimbabwe 


CI 


C6tc d'lvoire 


KP 


Democratic People's 


NZ 


New Zealand 






CM 


Cameroon 




Republic of Korea 


PL 


Poland 






ON 


China 


KR 


Republic of Korea 


PT 


Portugal 






cu 


Cuba 


KZ 


Kazakstan 


RO 


Romania 






cz 


Czech Republic 


LC 


SafaitLuda 


RU 


Russian Federation 






DE 


Germany 


U 


Liechtenstein 


SD 


Sudan 






DK 


Denmaric 


LK 


Sri Lanka 


SE 


Sweden 






EE 


Estonia 


LR 


Liberia 


SG 


Sin^ipoie 







wo 00/50567 PCT/DKOO/00063 



FUNGAL CELLS WITH INACTIVATED DNA MISMATCH REPAIR SYSTEM 



5 FIELD OF INVENTION 

A process for making DNA libraries in filamentous fungal 
cells using a novel cloned gene involved in the mismatch repair 
system of filamentous fungal cells. 

10 BACKGROUND OF THE INVENTION 

The mismatch repair system is a system within cells which 
recognises mismatches in newly synthesised duplex DNA 
. sequences. 

The mismatch repair system then either corrects the 
15 mismatches which are seen as errors by e.g. using the 

methylated ""old" strain as template or alternatively it may 
mediate degradation of the duplex DNA sequences which comprise 
the mismatches. 

Independently on the precise mechanism the end result 
20 will be that the ""mismatch repair system'' will limit the 
""diversity'' within a cell, diversity being represented as 
duplex DNA sequences which comprise mismatches. 

For example a duplex DNA sequence which comprises a 
single mismatch represents a diversity of two different DNA 
25 sequences within the cell. If the mismatch repair system 
corrects the mismatch there will only be a diversity of one 
within the cell. 

Alternatively, if the mismatch repair system mediates the 
degradation of such a duplex DNA sequence the diversity will be 
30 lost. See figure 1 for a graphic illustration on how the 
mismatch repair system may work within a cell. 

Consequently, if duplex DNA sequences comprising 
mismatches represent a DNA library of interest, then the 
diversity of this library may be limited when transformed 
35 (placed) into cells with an active mismatch repair system. 

The art provides a solution to this problem by making 
cells wherein the mismatch system is inactive. 
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EP 449923 describes bacterial cells wherein the mismatch 
system is inactivated. 

WO 97/37011 describes yeast cells wherein the mismatch 

system is inactivated. See the working examples of this 
document . 

WO 97/05268 describes mice cells wherein the mismatch 

system is inactivated. See the working examples of this 
document . 



10 SUMMARY OF THE INVENTION: 

The problem to be solved by the present invention is to 
provide an improved strategy for making DNA libraries in 
filamentous fungal cells. A filamentous fungal cell population 
comprising such a DNA library may then be used to select a 
15 polypeptide of interest. Also polynucleotide sequences with 
particular properties might be selected, such as promoters, 
terminators and other regulatory elements with changed/ improved 
properties . 

The solution is based on that the present inventors have 
20 cloned a NOVEL gene involved in the mismatch repair system of a 
filamentous fungal cell. Further, this gene is the first gene 
cloned which is involved in the mismatch repair system of a 
filamentous fungal cell. 

By inactivating this gene in a filamentous cell it is 
25 possible to obtain a filamentous cell which is deficient in its 
mismatch repair system and which is highly useful for preparing 
DNA libraries in filamentous fungal cells. 

The gene comprises a very characterising DNA sequence 
encoding the polypeptide sequence shown in positions 683-758 of 
30 SEQ ID NO 2. 

This DNA has been used to clone the full length gene 
encoding the polypeptide sequence shown in positions 1-940 of 
SEQ ID NO 2. See working examples herein (vide infra). 

The gene cloned as described in working examples herein 
35 is a gene cloned from an Aspergillus oryzae filamentous fungal 
cell . 

However, based on the novel sequence information provided 
herein it is routine work for the skilled person to clone 
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similar homologoias genes from other filamentous fungal cells by 
e.g. standard hybridisation or PGR technology, preferably by 
using the DNA sequence encoding the polypeptide sequence shown 
in positions 683-758 of SEQ ID NO 2 as a basis for making a 
5 hybridisation probe or PGR primers. 

Accordingly, in a first aspect the present invention 
relates to a filamentous fungal cell, wherein a gene involved 
in the mismatch repair system has been inactivated and in which 
the gene involved in the mismatch repair system comprises: 
10 (a) a DNA sequence encoding the polypeptide sequence 

shown in positions 683-758 of SEQ ID NO 2; or 
(b) a DNA sequence encoding a polypeptide sequence which 
is at least 70% identical to the polypeptide sequence 
shown in positions 683-758 of SEQ ID NO 2; 

15 and 

in a second aspect the present invention relates to a 
filamentous fungal cell, wherein a gene involved in the 
mismatch repair system has been inactivated and in which the 
gene involved in the mismatch repair system comprises: 
20 (a) a DNA sequence encoding the polypeptide sequence 

shown in positions 1-940 of SEQ ID NO 2; or 
(b) a DNA sequence encoding a polypeptide sequence which 
is at least 70% identical to the polypeptide sequence 
shown in positions 1-940 of SEQ ID NO 2 . 

25 

As stated above a filamentous fungal cell of the first or 
second aspect of the invention is very suitable for making a 
DNA library of interest in filamentous fungal cells. 

Accordingly, in a third aspect the present invention 
30 relates to a process for preparing a filamentous fungal cell 
population wherein individual cells in the population comprise 
individually different DNA sequences of interest representing a 
DNA library of interest comprising following steps: 

(a) placing individually different DNA sequences of 
35 interest in a filamentous fungal cell population 

comprising a filamentous fungal cell of the first or 
second aspect of the invention; and 

(b) growing the population of (a) for a period of time 
allowing an individual DNA sequence of interest in the 
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population to be duplicated at least once under 
conditions wherein the mismatch repair system gene of 
the first or second aspect of the invention has been 
inactivated. 

5 

One of the advantages of allowing an individual mismatch 
repair inactivated filamentous fungal cell duplicated DNA of 
interest at least once as descried under step (b) of the third 
aspect is illustrated in figure 1. As can be seen in figure 1 

10 the process of the third aspect using a filamentous fungal mis- 
match repair inactivated cell as described herein allows prepa- 
ration of a DNA library wherein eventual hetero duplex DNA mis- 
matches are not corrected. This gives a DNA library with a 
higher diversity as compared to a DNA library made in a fila- 

15 mentous fungal cell NOT having an inactivated mismatch repair 
system (see figure 1) . Duplication of DNA sequence of interest 
means that the two strands are replicated such that two sepa- 
rate sets of double stranded DNA are generated, each being 
based on a separate one of the two original strands. 

20 A filamentous fungal cell population wherein individual 

cells in the population comprise a DNA library of interest as 
described above may be used to select a polypeptide of 
interest . 

Accordingly, in a fourth aspect the present invention 
25 relates to a process for production of a polypeptide of 
interest comprising the steps of the third aspect and wherein 
the DNA sequences of interest encode a polypeptide of interest 
and which further comprises following step: 

(c) selecting from the resultant population of 
30 filamentous fungal cells of step (b) of the third 

aspect a desired polypeptide of interest. > 
An advantage of the process of the fourth aspect may be 
that the polypeptide of interest is selected from a filamentous 
fungal cell expressing the polypeptide. Consequently, it is 
35 directly known that the polypeptide can be expressed from a 
filamentous fungal cell, which is usefully if it is 
subsequently required to produce the polypeptide in large scale 
in a filamentous fungal cell. This may be of particular 
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interest when the DNA library encodes polypeptides of interest 
which are derived from filamentous fungal cells, since it is 
known that filamentous fungal polypeptides preferably are 
produced in industrial relevant high yields in filamentous 
5 fungal cells. 

This is contrary to a similar selection process using 
e.g. a yeast cell. Here the only thing known is that the 
selected polypeptide is capable of being expressed in yeast and 
later expression a filamentous fungal cell might give problems/ 
10 esp.ecially if high yields are- required. 

DEFINITIONS : 

Following section provides definitions of technical 
features in above-mentioned aspects of the invention. 

15 The term "" a gene" denotes herein a gene (a DNA sequence) 

will is capable of being expressed into a polypeptide within 
said cell. Accordingly, said gene sequence will be defined as 
an open reading frame starting from a start codon (normally 
""ATG", ""GTG", or ""TTG" ) and ending at a stop codon 

20 (normally ^^TAA", TAG'' or ^^TGA"). 

In order to express said gene there must be elements, as 
known in the art, in connection with the gene, necessary for 
expression of the gene within the cell. Such standard elements 
may include a promoter, a ribosomal binding site, a termination 

25 sequence, and may be others elements as known in the art. 

The term "" mismatch repair system " shall herein be 
understood according to the art, as a system within cells which 
recognises mismatches in duplex DNA sequences. See e.g. WO 
97/37011, page 1, line 21-28) 

30 The mismatch repair system then either corrects the 

mismatches which are seen as errors by e.g. using the 

methylated ""old" strain as template or alternatively it may 
mediate degradation of the duplex DNA sequences which comprise 
the mismatches, 

35 Independently on the precise mechanism the end result 

will be that the ""mismatch repair system" will limit the 
""diversity" within the cell represented by such duplex DNA 
. sequences which comprise mismatches. 
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For example a duplex DNA sequence which comprises a 
single mismatch represents a diversity of two different DNA 
sequences within the cell. If the mismatch repair system 
corrects the mismatch their will only be a diversity of one 
5 within the cell. Alternatively, if the mismatch repair system 
mediates the degradation of such a duplex DNA sequence this 
diversity will be lost, 

A polypeptide encoded by a gene involved in the mismatch 
repair system recognises a mismatch by a mechanism involving 
10 binding to the mismatch. 

Accordingly, a suitable assay to test whether or not a 
filamentous fungal cell as described herein is inactivated in 
its mismatch repair system is to use a ^^gel shift assay" or 
alternatively termed a ^^gel retardation assay'*. This is a 
15 standard assay used in the art. See WO 97/05268, page 16,17 and 
25. 

The principle in such an assay is that cell extracts are 
prepared of both (a) a filamentous fungal cell wherein the 
gene, as described herein, involved in the mismatch repair 

20 system is inactivated; and (b) the corresponding filamentous 
fungal cell wherein the gene is NOT inactivated. These extracts 
are then bound/mixed to oligonucleotides containing the base- 
pair mismatched G:T; G:A; G:G; A:C, and an extrahelical TG 
dinucleotide and run on a nondenaturing gel. 

25 If the gel shift assay demonstrates that the control 

filamentous fungal cell wherein the gene is NOT inactivated 
comprises any protein (s) which binds to any of above mentioned 
oligonucleotides and these binding protein (s) are NOT seen in 
the filamentous fungal cell wherein the gene, as described 

30 herein, involved in the mismatch repair system is inactivated 
then it is a confirmation that the mismatch repair system in 
the latter is inactivated. 

A detailed description of a suitable gel shift assay is 
provided in working example 1 herein. 

35 The sequence identity in relation to the terms 

"" a DNA sequence encoding a polypeptide sequence which is 
at least 70% identical to the polypeptide sequence shown in 

positions 683-758 of SEP ID NO 2 ' ' and 
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"" a DNA sequence encoding a polypeptide sequence which is 
at least 70% identical to the polypeptide sequence shown in 

positions 1-940 of SEP ID NO 2 '^ ; 

is determined as the degree of identity between two 
5 sequences indicating a derivation of the first sequence from 
the second. The identity may suitably be determined by means of 
computer programs known in the art such as GAP provided in the 
GCG program package (Program Manual for the Wisconsin Package, 
Version 8, August 1994, Genetics Computer Group, 575 Science 

10 Drive, Madison, Wisconsin, USA 53711) (Needleman, S-B. and 
Wunsch, CD., (1970), Journal of Molecular Biology, 48, 443- 
453) . Using GAP with the following settings for polypeptide 
sequence comparison: GAP creation penalty of 3.0 and GAP 
extension penalty of 0.1, the polypeptide encoded by an 

15 analogous DNA sequence of the invention exhibits a degree of 
idisntity preferably of at least 70%, more preferably at least 
80%, more preferably at least 90%, more preferably at least 
95%, and especially at least 97% with amino acid sequence shown 
in positions 683-758 of SEQ ID NO 2 , according to the first 

20 aspect of the invention; or with amino acid sequence shown in 
positions 1-940 of SEQ ID NO 2, according to the second aspect 
of the invention. 

The term "" DNA library '' denotes herein a library of at 
least two different DNA sequences. For many practical purposes 

25 the library is must bigger. Accordingly, the DNA library 
preferably comprises at least 1000 different DNA sequences, 
more preferably at least 10000 different DNA sequences, and 
even more preferably at least 100000 different DNA sequences. 

The term "^ placing individually different DNA sequences of 

30 interest in a filamentous fungal cell population " in relation 
to step (a) in the process of the third aspect of the invention 
shall herein be understood broadly in the sense that it is NOT 
identical DNA sequences of interest which are placed in the 
filamentous fungal cell population. In the present context, 

35 relating to a process for making a DNA library using a mismatch 
repair deficient cell, the term should preferably denotes a 
situation wherein a cell within the filamentous fungal cell 
population comprises at least two different DNA sequences of 



wo 00/50567 PCT/DKOO/00063 - 

8 

interest which are so partially homologous that they are 
capable of hybridising/recombining to each other within the 
cell. It is within the skilled persons general knowledge to 
determine how partially homologous such sequences have to be in 
5 order to obtain said recombination within the cell. 

A practical example may be that single stranded 
oligonucleotide sequences partially homologous to chromosomal 
DNA sequence are placed within the cell or duplex DNA sequences 
comprising mismatches (e.g. comprised within a vector) are 
10 placed within the cell. See ' below for further description of 
such examples. 

The specific experimental way of placing these DNA 
sequences within a filamentous cell may be done according to 
any of the many suitable techniques, such as transformation 
15 techniques . 

The term "" growing the population of (a) for a period of 
time allowing an individual DNA seguence of interest in the 
population to be duiplicated at least once under conditions 
wherein the mismatch repair system gene has been inactivated ' ' 

20 according to step (b) of the third aspect of the invention 
denotes that after an individual cell has duplicated itself at 
least once then the mismatch repair system may be activated 
again without loosing the advantage of the process. The 
technical reason for this is illustrated in figure 1. In this 

25 example a duplex DNA sequence comprising a single mismatch is 
placed in filamentous cell. After the cell has been duplicated 
once under conditions wherein the mismatch repair system gene 
has been inactivated the two individually different single 
stranded DNA sequences within the duplex DNA have individually 

30 been duplicated providing two different duplex sequences, one 
in each duplicated cell, without any mismatches. Accordingly, 
since such a cell does NOT comprise duplex DNA sequences of 
interest having mismatches then there is no technical need to 
maintaining the mismatch repair system inactivated. 

35 In sections below are described preferred embodiments of 

the invention by way of examples only. 



DRAWINGS : 
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Figure 1 : 

This figure illustrates an example wherein a duplex DNA 
sequence comprising a single mismatch is placed in filamentous 
cell. After the cell has been duplicated once under conditions 
5 wherein the mismatch repair system gene has been inactivated 
the two individually different single stranded DNA sequences 
within the duplex DNA have individually been duplicated 
providing two different duplex sequences, one in each 
duplicated cell, without any mismatches. On the contrary, in a 
10 cell wherein the mismatch repair system is active, a mismatch 
within a duplex is corrected. 

Figure 2 : 

This figure shows three partial Aspergillus oryzae 

15 polypeptide sequences: "msh2 * Ao-coll0/13/l5; derived from 
cloned PGR fragments. The three partial polypeptide sequences 
are aligned with two other partial polypeptide sequences of 
known mismatch repair proteins: a human mismatch repair 
protein, msh2-human.p; and a fungal Saccharomyces cerevisiae 

20 mismatch repair protein, S.c. msh2 . The underlined sequences in 
the figure derive from the construction of the PGR fragments. 

Figure 3 : 

This figure shows an alignment of the proposed 
25 polypeptide sequence of the putative Aspergillus oryzae 

mismatch repair protein (Ao.MSH2) with the polypeptide 
sequences of three known mismatch repair proteins from human 
( msh2 -human. p) , mouse (msh2-mus .p) , and yeast (S.c. msh2) . 

30 

DETAILED DESCRIPTION OF THE INVENTION 

A filcunentous fungal cell, as described herein, wherein a gene^ 
as described herein, involved in the mismatch repair system 
35 has been inactivated. 



Inactivation of a gene involved in the mismatch repair system: 
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The NOVEL gene, as described herein, involved in the 
mismatch repair system may be inactivated by any of the 
numerous known techniques known to the skilled person. 

An embodiment of the invention relates to a filamentous 
5 fungal cell as described herein, wherein the gene involved in 
the mismatch repair is defective. 

Numerous methods are known to the skilled person to make 
a gene defective when the DNA sequence is KNOWN. These methods 
includes deleting part of the DNA sequence of the gener- 
ic introducing frame -shift mutations by deleting or inserting 
nucleotides; introducing stop codons etc. 

A preferred embodiment of the invention relates to a 
filamentous fungal cell as described herein, wherein the gene 
involved in the mismatch repair has been inactivated 
15 transitorily. 

Similarly to above, a number of methods are known to the 
skilled person for doing this, including insertion of a 
regulable promoter upstream of the gene or e.g. permanently 
deleting part of the gene on the chromosome followed by 
20 inserting a vector (e.g. a plasmid) into the cell which 
comprises the gene. The plasmid may then comprise a regulable 
promoter up-steam of the gene or it may be that the plasmid can 
be removed from the cell when the mismatch repair system shall 
be inactivated transitorily and then re-inserted into the cell 
25 when the mismatch repair system shall be re-activated. 

It is within the skilled persons general knowledge to 
choose the appropriate strategy for a specific technical 
purpose . 

A preferred way to make a filamentous fungal cell which 
30 is capable of transitorily inactivate the mismatch repair 
system as described herein is first to permanently inactive the 
mismatch repair gene described herein on the chromosome of the 
cell followed by inserting a plasmid into the cell which 
comprises the gene, wherein the plasmid is characterised by 
35 that it comprises a suitable replication initiating sequence 
and a suitable selectable marker. 

Preferably the suitable replication initiating sequence 
is AMAl (Gems, D., et al. (1991, Gene 98:61-67). 
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A more detailed description of suitable replication 
initiating sequences and suitable selectable markers is 
provided immediately below and in working example 4 herein is 
provided an example of this strategy using a plasmid comprising 
5 AMAl as replication initiating sequence and AmdS as selectable 
marker . 

Replication initiating sequences 

As used herein, the term ''fungal replication initiating 

10 sequence'' is defined as a nucleic acid sequence which is capa- 
ble of supporting autonomous replication of an extrachromosomal 
molecule, e.g., a plasmid or a DNA vector, in a fungal host 
cell, normally without structural rearrangement of the plasmid 
or integration into the host cell genome. The replication ini- 

15 tiating sequence may be of any origin as long as it is capable 
of mediating replication initiating activity in a fungal cell. 
Preferably, the replication initiating sequence is obtained 
from a filamentous fungal cell, more preferably a strain of As- 
pergillus, Fusarium or Alternaria, and even more preferably, a 

20 strain of A, nldulans ^ A. oryzae, A. niger, F. oxysporum or Al- 
ternaria al tena ta . 

A replication initiating sequence may be identified by 
methods well-known in the art. For instance, the sequence may 
be identified among genomic fragments derived from the organism 

25 in question as a sequence capable of sustaining autonomous rep- 
lication in yeast, (Ballance and Turner, Gene, 36 (1985), 321- 
331), an indication of a capability of autonomous replication 
in filamentous fungal cells. The replication initiating activ- 
ity in fungi of a given sequence may also be determined by 

30 transforming fungi with contemplated plasmid replicators and 
selecting for colonies having an irregular morphology, indicat- 
ing loss of a sectorial plasmid which in turn would lead to 
lack of growth on selective medium when selecting for a gene 
found on the plasmid (Gems et al. Gene, 98 (1991) 61-67) . AMAl 

35 was isolated in this way. An alternative way to isolate a rep- 
lication initiating sequence is to isolate natural occurring 
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plasmids (e.g. as disclosed by Tsuge et al., Genetics 146 
(1997) 111-120 for Alternaria aternata) . 

Examples of replication initiating sequences include, but 
are not limited to, the ANSI and AMAl sequences of Aspergillus 
5 nidulans, e.g., as described, respectively, by Cullen, D., et 
al. (1987, Nucleic Acids Res. 15:9163-9175) and Gems, D., et 
al. (1991, Gene 98:61-67) . 

The term ^'replication initiating activity" is used herein 
in its conventional meaning, .i.e. to indicate that the sequence 
10 is capable of supporting autonomous replication of an ex- 
trachromosomal molecule, such as a plasmid or a DNA vector in a 
fungal cell. 

The term ''without structural rearrangement of the plasmid" 
is used herein to mean that no part of the plasmid is deleted 
15 or inserted into another part of the plasmid, nor is any host 
genomic DNA inserted into the plasmid. 

Filamentous fungal selective marker 

The term ''selective pressure" is defined herein as culturing 

20 a filamentous fungal cell, containing a DNA vector containing a 
fungal selective marker gene operably linked to a polynucleo- 
tide sequence of interest, in the presence of an effective 
amount or the absence of an appropriate selective agent. The 
effective amount of the selective agent is defined herein as an 

25 amount sufficient for allowing the selection of cells contain- 
ing the selection marker from cells which do not contain the 
selection marker. 

In a preferred embodiment, the fungal selective marker is 
selected from the group of genes which encodes a product capa- 

30 ble of providing resistance to biocide or viral toxicity, re- 
sistance to heavy metal toxicity, or prototrophy to auxotrophs. 

In a more preferred embodiment, the prototrophy is obtained 
from an enzyme selected from the group of metabolic pathways 
consisting of nucleotide synthesis, cofactor synthesis, amino 

35 acid synthesis, acetamide metabolism, proline metabolism, sul- 
fate metabolism, and nitrate metabolism. 
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In an even more preferred embodiment, in the methods of the 
present invention the fungal selective marker is a gene se- 
lected from the group consisting of argB (ornithine carbamoyl- 
transferase) , amdS (acetamidase) , bar (phosphinothricin acetyl- 
5 transferase), hemA ( 5-aminolevulinate synthase), hemB (porpho- 
bilinogen synthase), hygB (hygromycin phosphotransferase), niaD 
(nitrate reductase) , prn (proline permease) , pyrG (orotidine- 
5' -phosphate decarboxylase), pyroA, riboB, sC (sulfate adenyl- 
transf erase) , and trpC (anthranilate synthase). 

10 The fungal cell is cultivated in a suitable medium and under 
suitable conditions for screening or selecting for transfer- 
mants harbouring the variant polynucleotide sequence of inter- 
est having or encoding the desired characteristic. The cultiva- 
tion may be performed in accordance with methods well-known in 

15 the art for screening of polynucleotide variant libraries. 

The filamentous fungal cell 

The filamentous fungal cell as described herein includes 
all filamentous forms of the subdivision Eumycota and Oomycota. 

20 The filamentous fungi are characterised by a mycelial wall 
composed of chitin, cellulose, glucan, chitosan, mannan, and 
other complex polysaccharides. Vegetative growth is by hyphal 
elongation and carbon catabolism is obligately aerobic. In 
contrast, vegetative growth by yeasts such as Saccharomyces 

25 cerevisiae is by budding of a unicellular thallus and carbon 
catabolism may be fermentative. In a preferred embodiment, the 
filamentous fungal cell is a cell of a species of, but is not 
limited to, Acremonium, Aspergillus, Fusarium, Humicola, Mucor, 
Myceliophthora, Neurospora, Penicillium, Scytalidium, 

30 Thielavia, Tolypocladium, and Trichoderma, 

Examples of filamentous fungal cells of use in the present 
invention include an Aspergillus cell, an Acremonium cell, a 
Fusarium cell, a Humicola cell, a Mucor cell, a Myceliophthora 
cell, a Neurospora cell, a Penicillium cell, a Thielavia cell, 

35 a Tolypocladium cell, and a Trichoderma cell. 
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More specifically, the filamentous fungal cell is an Asper- 
gillus awamori, Aspergillus foetidus, Aspergillus japonicus, 
Aspergillus nidulans, Aspergillus niger, or Aspergillus oryzae 
cell; 

5 a Fusarium bactridioides, Fusarium cerealis, Fusarium 
crookwellense, Fusarium culmorum, Fusarium graminearum, 
Fusarium graminum, Fusarium heterosporum, Fusarium negundi, 
Fusarium oxysporum, Fusarium reticulatumr Fusarium roseum, 
Fusarium sambucinum, Fusarium sarcochroum, Fusarium 

10 sporotricioides, Fusarium sulphureum, Fusarium toruios ujn, 
Fusarium trichothecioides, Fusarium venenatum cell or a 
Fusarium venenatum cell (Nirenberg sp. nov; a Humicola 
insolens cell or a Humicola lanuginosa cell; a Mucor miehei 
cell; a Myceliophthora thermophila cell; a Neurospora crassa 

15 cell; a Penicillium purpurogenum cell; a Thielavia terrestris 
cell; or a Trichoderma harzianum, Trichoderma koningii, 
Trichoderma longibrachiatum, Trichoderma reesei, or Trichoderma 
viride cell. 

20 A process for preparing a filamentous fiingal cell population 
comprising a DNA library according to the third aspect of the 
invention: 

Placing individually different DNA secruences of interest in a 
25 filamentous fungal cell population according to step (a) of the 

process of the third aspect of the invention: 

As state above, the specific experimental way of placing 

these DNA sequences within a filamentous cell may be done 

according to any of the many suitable techniques, such as 
30 transformation techniques. See the general fungal textbook 

"^Fungal Genetic" (1996, ISBN 0-8247-9544-X) for a further 

description of such standard techniques. 

A practical example may be that single stranded 

oligonucleotide sequences partially homologous to chromosomal 
35 DNA sequence are placed within the cell. See Calissano et al. 
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(Fungal genetic newsletter 43:15-16 (1995) for further 
description of this. 

Another example may be that duplex DNA sequences 
comprising mismatches (e.g. comprised within a vector as shown 
5 in figure 1) are placed within the cell. 

In an preferred embodiment the different DNA sequences of 
interest is comprised in a plasmid wherein the plasmid is 
characterised by that it comprises a suitable replication 
initiating sequence and a suitable selectable marker as 
10 described above . 

Preferably the suitable replication initiating sequence 
is AMAl (Gems, D,, et al . (1991, Gene 98:61-67). 

Growincf the population of step (a) for a period of time 
15 allowing an individual DNA sequence of interest in the 
population to be duplicated at least once under conditions 
wherein the mismatch repair system gene, as describe herein has 
been inactivated, according to step (b) of the third aspect of 
the invention. 

20 Growing of the population may be done in any of the 

numerous suitable known media for growing filamentous fungal 
cells. It is within the skilled persons general knowledge to 
choose such a suitable media. 

As explained above an individual cell in the population 
25 must be allowed to be duplicated at least once under conditions 
wherein the mismatch repair system gene, as described herein 
has been inactivated. 

Said cells may of course be allowed to be duplicated more 

than once under conditions wherein the mismatch repair system 

30 gene has been inactivated. 

Since inactivation of the mismatch repair system normally 
will cause accumulation of mutations on the chromosomal DNA 
within the cell and thereby maybe make lethal mutations to the 
cell the actual preferred number of duplication cycles as 
35 described above will depend on how fast such potential lethal 
mutations arise. 

It is within the skilled persons general knowledge to 
determine how many of duplication cycles it preferred. 
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Due to these potential lethal mutations it is preferred 
that the mismatch repair system under step (b) has been 
inactivated transitorily. 

After suitable cycles of duplication according to step 
5 (b) of the third aspect the transitorily inactivated mismatch 
repair system the mismatch repair system is then re-activated 
in order to avoid these lethal mutations in the filamentous 
fungal cell as such. The strategy for this transitorily 
inactivation may be any of the strategies described above. 
10 Another strategy to limit introduction of mutations on 

the chromosome, is to transitorily stop the chromosomal 
replication while replicating the extra -chromosomal element 
under mismatch repair deficient conditions. This can be 
achieved by introducing mutations in elements being solely 
15 necessary for the chromosomal replication. 

A preferred strategy is to use a filamentous, fungal cell 
wherein the gene involved in the mismatch repair system as 
described herein is permanently inactivated on the chromosome 
of the cell followed by inserting a plasmid into the cell which 
20 comprises the gene, wherein the plasmid is characterised by 
that it comprises a suitable replication initiating sequence 
and a suitable selectable marker. See above for a further 
explanation of this strategy. 

Preferably the suitable replication initiating sequence 
25 is AMAl (Gems, D., et al. (1991, Gene 98:61-67). 

A further embodiment relates to the process of the third 
aspect of the invention, wherein the mismatch repair system 
under step (b) is defective. 

In a further embodiment the invention relates to a 
30 process as described herein, wherein, under step (b) of the 
third aspect of the invention, there is an in vivo intergenic 

recombination of partially homologous DNA sequences of 

interest . 

Since the overall concept of the present invention is to 
35 provide a process involving inactivation of the mismatch system 
it is of course preferably that said partially homologous DNA 
sequences are capable of in vivo forming duplex DNA sequences 
comprising mismatches. 
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A process for production of a polypeptide of interest 
comprising the steps of the third aspect of the invention cuid 
wherein the DNA sequences of interest encode a polypeptide of 
5 interest, according to the fourth aspect of the invention; 

Selecting from the resultant population of filamentous fungal 
cells of step (b) of the third aspect a desired polypeptide of 
interest, according to step (c) of the fourth aspect. 

10 The desired polypeptide of interest may be any 

polypeptide comprising an desired technical feature, such as 
improved stability; a desired specific activity; a desired pH 
optimum; an improved wash performance in a detergent; etc. 

The specific strategy for selecting this desired 

15 polypeptide of interest may be any of the numerous selecting 
strategies known to the skilled person, such as plate screening 
assays, micro- titer plate based assays, etc. 

An embodiment of the invention relates to a process of 
20 the fourth aspect of the invention, which further comprises 
following steps: 

(d) an optionally step comprising modifying the amino 
acid sequence of the desired selected polypeptide of 
interest according to a particularly further specific 

25 need; 

(e) placing the DNA sequence encoding the polypeptide of 
interest of step (c) of the fourth aspect or the 
modified polypeptide of interest of step (d) into a 
filamentous fungal cell which is suitable for large 

30 scale production of the polypeptide of interest ; 

(f) cultivating the filamentous fungal cell of step (e) 
in a fermentor of at least 10000 m3 under conditions 
permitting expression of the polypeptide of interest; 
and 

35 (g) isolating the polypeptide of interest. 

This embodiment relates to an industrial very relevant 
process, wherein the selected polypeptide of interest is 
produced in large scale. 
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The optionally step (d) relates to a situation wherein 
e.g. the desired polypeptide of interest is selected in order 
to e.g. identify a polypeptide with improved wash performance 
in a detergent according to step (c) of the third aspect of the 
5 invention. This polypeptide while having improved wash 
performance in a detergent may not be sufficiently stable for a 
commercial product. Accordingly, it may be required to make 
some further amino acid substitutions in this selected 
polypeptide, such as e.g. suitable Proline substitutions on 
10 order to obtain sufficient stability to commercialising this 
polypeptide . 

A further embodiment relates to a process of the 
embodiment immediately above, wherein the filamentous fungal 
cell which is suitable for large-scale production of the 
15 polypeptide of interest of step (e) said embodiment is another 

filamentous fungal cell as compared to the filamentous fungal 
cell of step (a) of the third aspect of the invention. 

This embodiment relates to a situation wherein the 
filamentous fungal cell used to select the polypeptide of 
20 interest is different from the one which is used for large 

scale production. 

A further embodiment relates to a process as described 

herein, wherein the polypeptide of interest is a polypeptide 

derived from a filamentous fungal cell. 
25 The term derived from a filamentous fungal cell should be 

understood in the sense, that the information in the amino acid 

sequence of the polypeptide of interest is derived from a 

polypeptide obtained from a filamentous fungal cell. 

Consequently, it may be a variant of a wild- type 
30 filamentous fungal polypeptide and/or may be a polypeptide 

which is a result of a recombination/shuffling of two or more 

different filamentous fungal polypeptides. 

In an even further embodiment the invention relates to a 

process as described herein, wherein the polypeptide of inter- 
35 est is an enzyme, such as an amylase, a protease, a cellulase, 

a lipase, a xylanase; a phospholipase • 



EXAMPLES : 
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Materials 

Chemicals used as buffers and substrates were commercial 
products of at least reagent grade. 

5 

EXAMPLE 1: 

A gel ehift assay suitable for determining if a filamentous 
fungal cell as described herein is inactivated in the mismatch 

repair system: 

10 The principle in this gel shift assay is that cell 

extracts are prepared of both (a) a filamentous fungal cell 
wherein the gene, as described herein, involved in the mismatch 
repair system is inactivated; and (b) the corresponding 
filamentous fungal cell wherein the gene is NOT inactivated. 

15 These extracts are then bound/mixed to oligonucleotides 
containing the base-pair mismatched G:T; G:A; G:G; A:C, and an 
extrahelical TG dinucleotide and run on a nondenaturing gel. 

If the gel shift assay demonstrates that the control 
filamentous fungal cell wherein the gene is NOT inactivated 

20 comprises any protein (s) which binds to any of above mentioned 
oligonucleotides and these binding protein (s) is NOT seen in 
the filamentous fungal cell wherein the gene, as described 
herein, involved in the mismatch repair system is inactivated 
then it is a confirmation that the mismatch repair system in 

25 the latter is inactivated. 
Experimental procedure: 

Preparation of cell extracts are performed as described 
in Nagata et al . (Mol. Gen Genet (1993) 237:251-260; See 
Materials and Methods) . 

30 Annealing of oligonucleotides, binding of cell extracts 

to duplex oligonucleotides containing mismatched, and 
nondenaturing polyacrylamide gelelectrophoresis are performed 
essentially as described (Stephenson and Karran; Selective 
binding to DNA base pair mismatches by proteins from human 

35 cells; J. Biol. Chem. 264:2177-21182 (1989)). 

However, gelelectrophoresis is performed in TAE buffer 
rather than in TBE buffer. To obtain duplex oligonucleotides, 
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the oligonucleotide U is radiolabelled and annealed with any of 
the unlabelled oligonucleotides L-G.T, L-G.A, L-G.C, L-A,C, L- 
T.G., or L-HOM. Oligonucleotide sequences are derived from 
Aquilina et al. Proc. Natl. Acad. Sci. USA 91:8905-8909 (1994). 

5 

52 B'-GGGAAGCTGCCAGGCCCCAGTGTCAGCCTCCTATGCTC-S" (SEQ ID NO 

3) ; 

L-G,T: 5'-GAGCATAGGAGGCTGACATTGGGGCCTGGCAGCTTCCC-3'' (SEQ ID NO 

4) (resulting in a G.T mismatch) ; 

10 L-G,A. ; 5"-GAGCATAGGAGGCTGACAATGGGGCCTGGCAGCTTCCC-3' (SEQ ID NO 

5) (resulting in a G.A mismatch) ; . 

L-G.G. ; 5"-GAGCATAGGAGGCTGACAGTGGGGCCTGGCAGCTTCCC-3" (SEQ ID NO 

6) (resulting in a G.G mismatch) ; 

L-A.C. ; 5'-GAGCATAGGAGGCTGACACCGGGGCCTGGCAGCTTCCC-3' (SEQ ID NO 
15 7) (resulting in a A.C mismatch) ; 

L-TG; B'-GAGCATAGGAGGCTGACACTGTGGGGCCTGGCAGCTTCCC-B" (SEQ ID NO 

8) (resulting in an extrahelical TG dinucleotide) ; 
L-HOM; 5^-GAGCATAGGAGGCTGACACCGGGGCCTGGCAGCTTCCC-3' (SEQ ID NO 

9) (resulting in a homoduplex) . 

20 

In all assays, a twofold excess of unlabelled homoduplex 
competitor oligonucleotide is included. 

EXAMPLE 2: 

25 Cloning of a gene involved in the mismatch repair system of an 
Aspergillus oryzae cell> 

The gene cloned as described in this example is shown in 
SEQ ID NO 1 (DNA sequence) and SEQ ID NO 2 (the translated 
amino acid sequence) . 
30 Several sequences of mismatch repair proteins from 

various organisms are known, only three of these have been 
utilized in the following: S. cerevisiae (M84170) , H. sapiens 
(L47580) and mouse {U21011) . 

The numbers indicated are reference numbers from the 
35 public available GenBank database. 
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Based on the C- terminal homology between known mismatch 
repair proteins, a set of degenerate primers were designed, 
aiming at amplification of a partial sequence of the 
Aspergillus oryzae homolog: 

5 

Pr 117858 {SEQ ID NO 10) : P - GGCNCARATHGGNTGYTTYGTNCC 
Pr 117859 (SEQ ID NO 11) : P-GCCCANGCNARNCCRAANCC 

With chromosomal DNA from A. oryzae strain JaL142 (WO 
10 96/29391) as template, and above primers the following 50/xl PWO 
polymerase based PGR reaction was performed at eight MgSO^ 
concentrations (0.5 mM to 4.0 mM, as recommended by the 
manufacturer; Boehringer M.). 1 mM MgSO^ was found to be optimal 
and gave a discrete band of appr. 230 bp as would be expected 
15 if no introns were embodied in the sequence. 

PCR-cycle profile: [96°C/ 2 min - 30 cycles of (94*'C; 15s - 
SO^'C; 15s - 72°C; 30s) - 72''C,7 min - 4*^0; hold]. 

20 The 23 0bp PGR fragment was blunt end ligated into filled 

in BamHl site of pUC19. pUC19 was BamHl cleaved in presence of 
calf intestine alkaline phosphatase, followed by filling in 
the sticky ends by klenow polymerase and dNTP. Three 
individual plasmids harbouring the insert were isolated from 

25 E.coli XLl transf ormants of above ligation, and sequenced. 
Alignment of polypeptides derived by translation of the cloned 
PGR fragments, revealed a strong homolgy to known mismatch 
repair protein sequences (see Figure 2) . 

The underlined sequences of Figure 2 are sequences 

30 derived from the consensus PGR primers described above. 

The three Aspergillus sequences of Figure 2 are equal to 

the sequence shown in SEQ ID NO 2 from positions 683-758, 
except from position 685 which in the final cloned sequence is 
a Thr (T) in sted of an lie (I) as indicated above. This is due 
35 to the sequence in above mentioned consensus primers. 

The alignment shown in Figure 2 clearly demonstrates that 
the cloned fragment originates from an A. oryzae homologue of a 
mismatch repair protein. 
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In order to clone the entire gene, a radiolabeled probe 
of the cloned fragment was generated by PGR, using 0.5 mg 
pUC19 'msh2 ' -13 (see above) as template in a 100 ml reaction 
with Taq polymerase, 30 pmol pUC forward and reverse primers 
5 and 0.2 mM of dG- , dC-, dTTP and 0.2mM dATP + "P-dATP. The 
generated radiolabeled probe was liberated from pUC19 sequences 
by EcoRl- Hind3 digestion followed by gel purification of the 
resulting 293 bp fragment. 

The probe was hybridized to a membrane gridded cosmid 
10 library of genomic DNA from A, oryzae strain A1560 (the father 

of JaL142) (W096/29391) . A positive clone was identified on the 
filter when analyzed in a phosphoimager, and the clone was 
identified as X31A2. 

The ^31A2 cosmid DNA was propagated and used for southern 

15 analysis, using the same radiolabeled primer as above. An 
approximately 9 Kb Pst fragment, split by BstX (previously 
found in the cloned PGR fragment) into 5.8 and 3.2 kb fragments 
both lightening up with the probe, was identified and cloned 
into Pst cut pUG19, giving a plasmid named pUC19msh2P. The 

20 insert was sequentially sequenced, starting with primers 
pointing out from the previously determined sequence, followed 
by primers based on the sequences determined in the last run: 
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GCTCGAAACATCCAACATCC 

GCTGTGAATCACTTGCACC 

CTTCATAAACTGCGACAAATCATGC 

GGAGGAGCATCTTCGC 

GGAACTTGAAGACTTTACTTCATCC 

CCAGAAACTCGCTAACACC 

GTGCTTTGCGGACGC 

CAGGACAGTAGGACGC 

CGAGCGATGAACTCTGC 

GCGTTGGTGGATTATCC 

CGTTGCATCTATCATATACC 

GGTATATGATAGATGCAACGC 



The 3825 bp sequence hereby determined (SEQ ID NO 1) was 
translated in the frame previously determined in the PGR 
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fragment. The resulting protein (SEQ ID NO 2) called Ao.MSH2 
was aligned to the protein sequences of known mismatch repair 
proteins in Figure 3. From the alignment in shown in Figure 3 
the cloned and sequenced DNA clearly encompasses the coding 
5 sequence for a homolog of yeast, man and mouse mismatch repair 
proteins, with one intron in the N- terminal part. The position 
of the intron was deduced by the standard splice rules, and 
constitutes the only possibility. 

10 EXAMPLE 3: 

Disruption of the gene cloned in example 1 on the chromosome of 

an Aspergillus oryzae cell; 

For the disruption experiment the msh2 CDS was deleted 
from pUC19msh2P (see example 2) by PGR, introducing a Notl site 
15 instead. This was done by the primers: 

137208 (SEQ ID NO 24): 5* P - CCGCGTCTCCAACAAGATGAATGG 
13 7207 (SEQ ID NO 25) : 5' P - CCGCTTTCTCGGGGTCATAGC 

20 In a Pwo polymerase based PGR reaction with 2,5 mM MgSO^ 

and 50 pg pUC19msh2P (conditions according to the 
manufacturer) : . 

PCR-cycle profile: [SG^'C; 2min - 4 cycles of (94''C; 30s - 
25 52^C;30s - 72°C; 3min) - 25 cycles of {94*'C; 30s - SS^'G; 30s - 
72**G; 3min) - 72°C; lOmin] 

The resulting PGR product of appr. 8.9 Kb was isolated, 
ligated into pUG19, and transformed into E. coli XLl. From the 

30 resulting transf ormants pMsh2A was isolated, and the 
correctness of the new junction and its surroundings verified 
by sequencing [primer 138149 (SEQ ID NO 26) : 
CGTTTGGAGTTTAATGGTAAGC] . (Xll/pMsh2A: Lac3073). 

In this construct it the A. oryzae pyrG (WO 96/29391) is 

35 inserted into the Notl site. 

By using this construct the chromosomal gene is deleted 
in an Aspergillus oryzae cell according to standard techniques 
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known in the art for crossing in such a deleted gene on the 
chromosome by homologous recombination (Miller, B.L., et al*, 
1985 Mol. and Cell. Biol. 5:1714-1721). 

5 EXAMPLE 4: 

Construction of a plasmid comprising the mismatch repair gene 
shown in SEQ ID NO 1, the AMAl replication initiating sequence; 
and the AmdS selectcible marker: 

The plasmid constructed as described below is highly 

10 suitable for making a filamentous fungal cell wherein the 
mismatch repair system may be transitorily inactivated, wherein 
this plasmid may be inserted into a mismatch disrupted strain 
of example 3 when the mismatch repair system shall be activated 
and deleted from the strain when the mismatch repair system 

15 shall be inactivated. 

Disruption of the mismatch repair gene may cause the 
accumulation of new chromosomal mutations, thus such a strain 
might be genetically unstable. Consequently it was decided to 
perform the chromosomal disruption in a strain where mismatch 

20 repair gene was expressed from an extra chromosomal element 
readily lost when the Amismatch repair phenotype was wanted. 

The extra -chromosomal element was here a plasmid 
comprising AMAl as replication initiating sequence and AmdS as 
selectable marker, 

25 For this purpose the mismatch repair gene (SEQ ID NO 1) 

was cloned into an autonomously replicating construct 
harbouring one AMAl repeat . 

From pMT1505 (See example 5 below for description of 
pMT1505) the following fragments were isolated and ligated 

30 together: 

5.16 kb NotI- [Hind3] * + 3.515 kb [Sal]*-BamHl + 757bp 
BamHl- NotI 

[ ] * indicates that the site has been filled in by 
35 Klenow-polymerase and dNTP 

From this ligation reaction pMTlSOSDHS was isolated (LaC 
3212) , and the mismatch repair expression cassette was 
introduced as a BamHl - Muni fragment in the corresponding 
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sites in pMT1505DHS, resulting in the plasmid pAma-msh2 (LaC 
3216). 

Aspergillus oryzae JaL250 (see example 5) was transformed 

AmdS* with pAma-msh2, and the transf ormants cheked for the 
5 ability to lose the amdS character when unselected (50% of the 
transf ormants) , indicating the maintanance of this plasmid as 
extra chromosomal. (LaC3244 keep on acetamide + uridine). 



EXAMPLE 5: 

10 Construction of plasmid pMT1505 used in example 4: 
Plasmids 

pMT1505: constructed as described below in Example 5 

pHelpl: contains the pyrG gene from A. oryzae as a selective 

marker and the AMAl sequences which enable autonomous 
15 replication in A. nidulans as described by Gems, D., 

et al, (1991, Gene 98: 61-67) 
pToC68: as described in EP 0 531 372 (Novo Nordisk A/S) 



Strains 

20 JaL250: a derivative of Aspergillus oryzae A1560 in which the 
pyrG gene has been inactivated, as described in WO 
98/01470 

DH5a: an E. coli host cell purchased from GIBCO BRL (Life 

Technologies, Inc., Rockville MD) 

25 

pMT14 66 was constructed by inserting an Sphl/Narl fragment 
from pHelpl into pToC68. pMT1489 was constructed by digesting 
pMT1466 with SphI and StuI, then religating. pMTlSOO was con- 
structed by digesting pMT1489 with Aatll and Narl and ligating 

30 a linker. pMT1504 was constructed by digesting pMTlSOO with 
Nhel and religating. 

pMT1505 was constructed by inserting a 2.7 kb Xbal fragment 
containing the amdS encoding gene from A. nidulans genomic DNA 
(Corrick, CM., et al , 1987, Gene 53:63-71) into pMT1504 which 

35 had been cut with Nhel. 



V 
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EXAMPLE 6: 

Deletion of part of the mshll gene on the chromosome 

The plasmid p418MsHII (from lac3159) is cut with Sail and 
Xhol and treated with calf -intestinal phosphatase. In this man- 
ner part of the msHll gene is cut out. The large band (6400 bp) 
5 containing the vector and most of the msHII gene is isolated 
from a' 1% agarose gel. 

The plasmid pJal554 was constructed by ligating a 
Spel/SspBI cut fragment (5330 bp) from pS02 with a Asp718/Nhel 
cut fragment (316 bp) from pS02. Plasmid pJal554 is cut with 
Sail and a 2350 bp band which contains the pyrG gene is 
isolated on a 1% agarose gel. The 2350bp band with pyrG is 
ligated with the cut p418MsHII plasmid and transformed into E. 
coli. The right E, coll transformant is identified by 
restriction analysis and a plasmid preparation is made from 
this transformant . 

The plasmid thus prepared is cut with £coRI in order to 
linearize the plasmid before it is transformed into for example 
Aspergillus oryzae Jal250. Transf ormants are selected on mini- 
10 mal plates. 

A transformant where a double crossover event has taken 
place is identified by making an Aspergillus chromosomal DNA 
prep followed by a PGR screen for full-length mshll gene using 
appropriate primers. A Southern blot is made using chromosomal 
15 DNA which is randomly fragmented with appropriate enzymes as 
well as appropriate probes for the deleted msHll fragment 
(which is not there any longer) as well as a positive control 
probe . 

In order to determine an increased mutation frequency in 
20 the strain with an inactivated msHll gene, a screen for muta- 
tions in the niaD gene is made. This is done by growing the 
parent strain Aspergillus Jal250 and the msHll inactivated 
strain on plates. 

A spore-suspension is made and aliquotes of spores are 
25 plated on chlorate-containing plate as described by Unkles et 
al. (S. E. Unkles, E.C.Campbell. Y.M.J.T. de Ruite Jacobs, M. 
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Broekhuisen, J. A. Macro, D.Carrez, R. Contreras, C. A. M.J. J. van 
den Hondel J.R. Kinghorn. The development of a homologous 
transformation system for Aspergillus oryzae based on nitrate 
assimilation pathway: A convenient and general selection system 
5 for filamentous fungal transformation. Molecular and general 
genetics V:218 p 99-104 (1989)). 

The strain with no expression of the MsHII protein will 
have a higher rate of niaD mutations (more chlorate resistant 
clones), than the control strain. 

10 

EXAMPLE 7: 

Making In Vivo antisense msHIl RNA to inhibit translation of 
msHIl mRNA using the TAKA promoter to drive transcription of 
the Anti "Sense mRNA 

15 Anti-sense RNA expression is a well known way to down 

regulate expression of any gene in vivo (The design of An- 
tisense RNA , Georg Sczakiel , Antisense and nucleic acid drug 
development V. 7 P. 439-444 (1997)) 

A per fragment is made using the oligo's: 

20 000120j2 (SEQ ID NO 27): TCTGCGAATCGCTTGGATCCCGAACGCGACAACAC, 
000120 j 4 (SEQ ID NO 28): GAGCTCAGATCTCTTAGGTTCTGGACGAGAAGA, 
and pUC19msh2P as template. This PGR fragment contains the 5 
'end of the msHll gene including the presumed part of 5' msHll 
mRNA. Another PGR fragment is made using the oligo's: 

25 000120 j 3 (SEQ ID NO 29): GTTGTCGCGTTCGGGATCCAAGCGATTCGCAGAAG, 
12 98 -TAKA (SEQ ID NO 30) : GCAAGCGCGCGCAATACATGGTGTTTTGATCAT, 
and pENI1298 as template (PCT DK99/00552) . 

Both PGR reactions are done using PWO polymerase according to 
the manufacturers manual (Boehringer-Mannheim) . 

30 The PGR fragments are purified using the Qiagen PGR puri- 

fication kit (Qiagen) . The two PGR fragments are mixed and a 
third PGR reaction is done with primer 1298-TAKA and 000120j4. 
In this manner the two PGR fragments are assembled. 

The assembled PGR fragment is cut with BssHII and Bglll, 

35 and purified from a 1.5 % agarose gel and ligated with pENI1298 
which was cut with BssHII and Bgl II (purified from 1 % agarose 
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gel) . The ligation mixture is transformed into E, coli. A DNA- 
prep is made of each of the resulting E. coli transf ormants . 
The assembled PGR fragment is sequenced to confirm that no un- 
wanted mutations are introduced during the procedure- The cor- 
5 rect construct contains the TAKA promoter, which drives the 
transcription of the msHll anti-sense mRNA. 

The resulting plasmid is transformed into for example As- 
pergillus oryzae Jal250 along with pENI1298 as control, and 
transf ormant are selected on minimal plates. The resulting 

10 transf ormants are isolated on minimal plates and incubated at 
37°C until they sporulate. 

In order to determine an increased mutation frequency in 
the strain, where the translation of the msHll mRNA is impeded 
due to msHll anti-sense RNA expression, a screen for mutations 

15 in the niaD gene is made. A spore-suspension is prepared of the 
control transf ormants (pENil298) and of the jnsHII Anti-sense 
RNA transf ormants, and equal amounts of spores are plated on to 
chlorate-containing plate as described by Unkles et al. (S. 
E.Unkles, E.C.Campbell. Y.M.J.T. de Ruite Jacobs, M. Broekhui- 

20 sen, J. A. Macro, D.Carrez, R. Contreras, C. A. M.J.J, van den 
Hondel J.R. Kirighorn. The development of a homologous transfor- 
mation system for Aspergillus oryzae based on nitrate assimila- 
tion pathway: A convenient and general selection system for 
filamentous fungal transformation. Molecular and general genet- 

25 ics V:218 p 99-104 (1989)). 

The strain with no or low expression of the MsHII protein 
will have a higher rate of niaD mutations (more chlorate resis- 
tant clones) , than the control strain. 
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CLAIMS 

1. A filamentous fungal cell, wherein a gene involved in the 
mismatch repair system has been inactivated and in which the 
gene involved in the mismatch repair system comprises: 

5 (a) a DNA sequence encoding the polypeptide sequence 

shown in positions 683-758 of SEQ ID NO 2; or 
(b) a DNA sequence encoding a polypeptide sequence which 
is at least 70% identical to the polypeptide sequence 
shown in positions 683-758 of SEQ ID NO 2 . 

10 

2. A filamentous fungal cell, wherein a gene involved in the 
mismatch repair system has been inactivated and in which the 
gene involved in the mismatch repair system comprises: 

(a) a DNA sequence encoding the polypeptide sequence 
15 shown in positions 1-940 of SEQ ID NO 2; or 

(b) a DNA sequence encoding a polypeptide sequence which 
is at least 70% identical to the polypeptide sequence 
shown in positions 1-940 of SEQ ID NO 2. 

20 3. The filamentous fungal cell of claim 1 or 2 , wherein the 
gene involved in the mismatch repair is defective. 

4. The filamentous fungal cell of claim 1 or 2, wherein the 
gene involved in the mismatch repair has been inactivated 

25 transitorily. 

5. The filamentous fungal cell of any of the preceding claims, 
wherein the filamentous fungal cell is a strain of Fusarium or 
more preferably Aspergillus. 

30 

6. A process for preparing a filamentous fungal cell population 
wherein individual cells in the population comprise 
individually different DNA sequences of interest representing a 
DNA library of interest comprising following steps: 

35 (a) placing individually different DNA sequences of 

interest in a filamentous fungal cell population 
comprising a filamentous fungal cell of claim 1 or 2; 
and 
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(b) growing the population of (a) for a' period of time 
allowing an individual DNA sequence of interest in the 
population to be duplicated at least once under 
conditions wherein the mismatch repair system gene of 
5 claim 1 or 2 has been inactivated. 

7, The process of claim 6, wherein the mismatch repair system 
under step (b) is defective. 

10 8 . The process of claim 6 , wherein the mismatch repair system 
under step (b) has been inactivated transitorily. 

9. The process of any of claims 6-8, wherein the filamentous 
fungal cell is a strain of Fusarium or more preferably 

15 Aspergillus . 

10. The process of any of claims 6-9, wherein, under step (b) 
of claim 6, there is an in vivo intergenic recombination of 
partially homologous DNA sequences of interest. 

20 

11. A process for production of a polypeptide of interest 
comprising the steps of claim 6 and wherein the DNA sequences 
of interest encode a polypeptide of interest and which further 
comprises following step: 

25 (c) selecting from the resultant population of 

filamentous fungal cells of step (b) of claim 6 a 
desired polypeptide of interest. 

12. The process of claim 11, which further comprises following 
30 steps: 

(d) an optionally step comprising modifying the amino 
acid sequence of the desired selected polypeptide of 
interest according to a particularly further specific 
need; 

35 (e) placing the DNA sequence encoding the polypeptide of 

interest of step (c) of claim 11 or the modified 
polypeptide of interest of step (d) into a filamentous 
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fungal cell which is suitable for large scale 
production of the polypeptide of interest; 

(f) cultivating the filamentous fungal cell of step (e) 
in a fermentor of at least 10000 mB under conditions 

5 permitting expression of the polypeptide of interest; 

and 

(g) isolating the polypeptide of interest. 

13. The process of claim 12, wherein the filamentous fungal 
10 cell which is suitable for large scale production of the 
polypeptide of interest of step (e) of claim 12 is another 
filamentous fungal cell as compared to the filamentous fungal 
cell of step (a) of claim 6. 

15 14. The process of any of claims 11-13, wherein the polypeptide 
of interest is a polypeptide derived from a filamentous fungal 
cell. 

15. The process of any of claims 11-14, wherein the polypeptide 
20 of interest is an enzyme, such as an amylase, a protease, a 
cellulase, a lipase, a xylanase; a phospholipase . 



25 
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SEQUENCE LISTING 

<11G> Novo Ncrnisk A/5 

<120> A f ilaip.er.' r j£ fungal mismatch repair defective cell 

<130> Mismatch repair deficient fila, fungi 

<140> 
<141> 

<160> 30 

<170> Patent In Ver. 2.1 

<210> 1 
<211> 3823 
<212> DNA 

<213> Aspegilius oryzae 

<220> 
<221> CDS 

<222> (700) . . {723; 
<220> 

<221> intron 
<222> (724) . . (780) 

<220> 
<221> CDS 

<222> (781) . . (3579) 

<400> 1 



gtgggtagtg 


gcccaaaagc 


tactgtggct 


gcccagagga 


agccgtttcg 


cctagagatg 


60 


atcgtacaga 


acgtccagga 


ctcatcgaag 


aatcctattt 


cagagaaaga 


ggtggaaatc 


120 


tgcgtcgaag 


tgctggctcg 


gcccgacatt 


gctggacaat 


gggtcgattt 


cgtcaccgtg 


180 


aatcatatca 


aatcogrggt 


tctgaaatcc 


tccgcggata 


tcaacctcaa 


ggatatcggt 


240 


gcgaaggtgc 




Outcggcgag 


gacgagcctg 


cttcagcctc 


gaacccctaa 


300 


tcaagacttc 


tactcr wtaa 


tgtgtgtttt 


ggatgtttgg 


tgttgctggg 


ttggataccc 


360 


catctgtgga 


gtt tga;:aca 


catgacttta 


ttatcaccct 


tgtgtagcat 


ctgttagcgt 


420 


tgcatctatc 


atataccatt 


ttagctatta 


gagaatacat 


atcaatcatg 


atgaacatga 


480 


agtacttcag 


ttcctatgtg 


taggcgtttt 


ttggacctat 


cactttgtag 


agttctaacg 


540 


ggtgctatta 


ttggccaatg 


atacttcaat 


atatcgcgaa 


cgcgacaaca 


cgtgaccgcg 


600 


ttgccacgga 


gcgcgcctcc 


tggaattatc 


gaaatagatc 


gatcacggca 


gagctaatgg 


660 


tcagtcttcc 


attcatcttg 


ttggagacta 


actggcaag atg tct tct 
Met Ser Ser 


cgt cca 
Arg Pro 


714 



1 5 
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gaa ctt aag gtaagtaaac. aagacaaccg gtctctcgaa cattcaataa 763 
Glu Leu Lys 

caattaaccc tqtzzaq gtt gac gac gaa gtc ggc ttc att cgt ttt tac 813 
Val Asp Asp Glu Val Gly Phe lie Arg Phe Tyr 
10 15 

cgt tec etc gca gca aat age aac gat gaa act att egc gtt ttc gac 861 

Arg Ser Leu Ala Ala Asn Ser Asn Asp Glu Thr lie Arg Val Phe Asp 
20 25 30 35 

egc ggt gac tgg tac tet gcc eat ggc gee aaa gca gag ttc ate get 909 

Arg Gly Asp Trp Tyr Ser Ala His Gly Ala Lys Ala Glu Phe He Ala 
40 45 50 

egc act gtg tac aac ace acc tet ata etc egc aat eta ggt egc age -957 

Arg Thr Val Tyr Lys Thr Thr Ser He Leu Arg Asn Leu Gly Arg Ser 

5 5 60 65 

gac tea gga ggc ctz ecc tec gtc acc atg agt gtc acc gtc ttc cgt 1005 

Asp Ser Gly Gly Leu Pro Ser Val Thr Met Ser Val Thr Val Phe Arg 
70 75 80 

aac ttt etc egc gaa get etc ttc ega etc aac aag egc att gaa ate 1053 

Asn Phe Leu Arg Glu Ala Leu Phe Arg Leu Asn Lys Arg He Glu He 
85 90 95 

tgg ggc tea gtc gga aeg ggc aag ggt cat tgg aag ctg gta aag caa 1101 

Trp Gly Ser Val Gly Thr Gly Lys Gly His Trp Lys Leu Val Lys Gin 
100 105 110 115 

get age ccg gga aac etc caa gat gtg gaa gaa gag ttg ggc age gtt 1149 

Ala Ser Pro Gly Asn Leu Gin Asp Val Glu Glu Glu Leu Gly Ser Val 
120 125 130 

ggt gga tta tec atg gac teg get eca att ate eta gca gtg aag ate 1197 

Gly Gly Leu Ser Mec Asp Ser Ala Pro He He Leu Ala Val Lys He 

135 140 145 

teg gcc aag gcc gca gag get agg agt gtg gga gtg tgc ttt geg gac 1245 

Ser Ala Lys Ala Ala Glu Ala Arg Ser Val Gly Val Cys Phe Ala Asp 
150 .155 160 

gca agt gta egg gaa etc ggt gtt age gag ttt ctg gat aac gat ate 1293 

Ala Ser Val Arg Glu Leu Gly Val Ser Glu Phe Leu Asp Asn Asp He 
165 170 175 

tat tec aac ttt gag teg ctt att ate caa etc ggg gtg aag gag tgt 1341 

Tyr Ser Asn Phe Glu Ser Leu He He Gin Leu Gly Val Lys Glu Cys 
180 185 190 195 

ttg gtg cag atg gat get aat aag aag gat gtt gag ctg gga aag att 1389 

Leu Val Gin Met Asp Ala Asn Lys Lys Asp Val Glu Leu Gly Lys He 
200 205 210 

egg get att gcc gat agt tgt ggg ate get ate tec gag agg ccg gtg 1437 

Arg Ala He Ala Asp Ser Cys Gly He Ala He Ser Glu Arg Pro Val 

215 220 225 
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get gat ta:: ggt gzc aag gat att gag cag gat ctg acg agg ttg ttg 1485 

Ala Asp Tyr Gly Val Lys Asp He Glu Gin Asp Leu Thr Arg Leu Leu 
230 235 240 

agg gat caa egg zzq get ggt acg ctg ccg cac acg gag eta aag ctz 1533 

Arg Asp Glu Arq Ser Aia Gly Thr Leu Pro Gin Thr Glu Leu Lys Leu 

245 250 255 

gcg atg cgz teg ccg tct gcg ttg ate aag tac ctt ggg qtz atg acg 1581 

Ala Met Gly Ser ~la Ser Ala Leu He Lys Tyr Leu Gly Vai Met Thr 

260 265 270 275 

gat cct aca aac zzc ggc cag tae cag etc tat cag cat gat ttg teg 1629 

Asp Pro Thr Asn ?he Gly Gin Tyr Gin Leu Tyr Gin His Asp Leu Ser 
230 285 * 290 

cag ttt arg aag zzg gat teg teg gcg ctg cgt get ett aac ett atg 1677 

Gin Phe Mez Lys leu Asp Ser Ser Ala Leu Arg Ala Leu Asn Leu Met 
295 300 305 

eet ggt ccg egg gac gga teg aag tct atg agt ttg ttt ggt ttg ttg 1725 

Pro Gly Pre Arg Asp Gly Ser Lys Ser Met Ser Leu Phe Gly Leu Leu 
31C 315 320 

aat cac tgc aag ace eet gtt ggt age egg ttg ett gcg cag tgg ctg 1773 

Asn His Cys Lys Thr Pro Val Gly Ser Arg Leu Leu Ala Gin Trp Leu 

325 330 335 

aaa cag cec ttg azq gat ctg gcg gag ate gag aag aga cag cag ett 1821 

Lys Gin Pro Leu Met Asp Leu Ala Glu He Glu Lys Arg Gin Gin Leu 

340 345 350 355 

gtt gag gcg ttt grt gtt aac acg gag etc aga cag act atg cag gag 1869 

Val Glu Ala Phe Val Val Asn Thr Glu Leu Arg Gin Thr Met Gin Glu 
360 365 370 

gag eat ett cgc tee ata ccg gat ctg tat aga eta gcg aag egg tte 1917 

Glu His Leu Arg Ser He Pro Asp Leu Tyr Arg Leu Ala Lys Arg Phe 
375 380 385 

cag cgc aaa cag gca aac ztg gaa gac gtt gtg egg gtg tae cag gtt 1965 

Gin Arg Lys Gin Ala Asn Leu Glu Asp Val Val Arg Val Tyr Gin Val 
39C 395 400 

get att cqz ttg czz ggt "tt gtc aac tct etc qag aat gtt atg gat 2013 

Ala He Arc Leu Fro Gly Phe Val Asn Ser Leu Glu Asn Val Met Asp 

405 410 415 

gaa gag taz cag acg cec ctg gag acg gag tat act tec aac etc egg 2061 

Glu Glu Tyr Gin Thr Pro Leu Glu Thr Glu Tyr Thr Ser Asn Leu Arg 

420 425 430 435 

agt cac tec gat age tta gcg aaa ctg gag gag atg gtt gag act acg 2109 

Ser His Ser Asp Ser Leu Ala Lys Leu Glu Glu Met Val Glu Thr Thr 
440 445 450 

gtt gac ett gat gcg ctg gag aac cac gag tte ate ate aag eet gag 2157 

Val Asp Leu Asp Ala Leu Glu Asn His Glu Phe He He Lys Pro Glu 
455 460 465 
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ttt gac gag agt etc egg ate ate agg aag aag ctg gac aag etc cgt 2205 

Phe Asp Glu Ser Leu Arg lie lie Arg Lys Lys Leu Asp Lys Leu Arg 

470 475 480 

cat gat atg ggc czz gag cac cgc agg gta get egg gac ctt gac caa 2253 

His Asp Met Giy Vsi Glu His Arg Arg Val Ala Arg Asp Leu Asp Gin 
485 490 495 

gat act gag aag aac ztq tte ctg gag aae eac agg gtg cac gga tgg 2301 

Asp lie Glu Lys Lys Leu Phe Leu Glu Asn His Arg Val His Gly Trp 

500 505 510 515 

tgc tte ega etc ac> cgc aac gag teg gga tge ate cgc aat aag aga 2349 

Cys Phe Arg Leu Thr Arg Asn Glu Ser Gly Cys He Arg Asn Lys Arg 

52-: 525 530 

gag tac cag gaa cgt -ct aca cag aag aac ggt gtc tac tte act acg 2397 

Glu Tyr Gin Glu Cys Ser Thr Gin Lys Asn Gly Val Tyr Phe Thr Thr 

535 540 545 



teg act atg caa ace ztg cgc egg gag cat gat caa ctg tec teg aac 
Ser Thr Met Gin Thr Leu Arg Arg Glu His Asp Gin Leu Ser Ser Asn 
550 555 560 



att ggt gtt azc qcz zzc atg get cag acg ggc tgc ttt gtg cet tgt 
He Gly Val lie Aia leu Met Ala Gin Thr Gly Cys Phe Val Pro Cys 
680 685 690 



2445 



tac aat aga act cag ace ggc ctg gtg aat gag gtc gtt aac gtt gee 2493 

Tyr Asn Arg Thr Gin Thr Gly Leu Val Asn Glu Val Val Asn Val Ala 

565 570 575 

gcg tec tac tgt cet gtt ttg gaa cga ctt gcc ggt gtc ata gca cac 2541 

Ala Ser Tyr Cys Pro Val Leu Glu Arg Leu Ala Gly Val He Ala His 
580 585 590 595 

etc gat gtc att: gta age tte get cat get tet gtt cat gcg ecg ace 2589 

Leu Asp Val He Val Ser Phe Ala His Ala Ser Val His Ala Pro Thr 

600 605 610 

cec tat get egg ccc aag atg cac eca cga ggc ace gga aac aca gtt 2637 

Pro Tyr Ala Arg Pro Lys Met His Pro Arg Gly Thr Gly Asn Thr Val 

615 620 625 

etc aag gaa gcg cgc cac ccc tgt atg gaa atg cag gat gat att tea 2685 

Leu Lys Glu Ala Arg His Pro Cys Met Glu Met Gin Asp Asp He Ser 

630 635 640 

tte att act aat gat gtg get ttg gtc cga gac gag tec tec tte etc 2733 

Phe He Thr Asn Asp Val Ala Leu Val Arg Asp Glu Ser Ser Phe Leu 

645 650 655 

ate att act get cet aac atg gga ggt aaa teg act tat att cgc caa 2781 

He He Thr Gly Pro Asn Met Gly Gly Lys Ser Thr Tyr He Arg Gin 
660 665 670 675 



2829 



aca gaa gca gaa ttg acc ate ttt gac tgt ate ctt gca cgt gtt ggt 2877 
Thr Glu Ala Glu Leu Thr He Phe Asp Cys He Leu Ala Arg Val Gly 
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ori 700 705 

gca agt zaz zz^ zaq etc aag gga gtt tec act ttc atg get gag atg 2925 

Ala Ser Asp 5rr 31n Leu Lys Gly Val Ser Thr Phe Met Ala Glu Met 

"i: 715 720 

etc gaa aza zzz sac ate etc aag teg gca acg tec gag tct ett ate 2973 

Leu Glu Thr Ser Asn He Leu Lys Ser Ala Thr Ser Glu Ser Leu He 

725 730 735 

ate ate cac gag ctt ggg cgc ggt aca age acg tat gac gga ttc gge 3021 

He lie Asp. GI j Leu Gly Arg Gly Thr Ser Thr Tyr Asp Gly Phe Gly 

740 745 750 755 

eta gca tgg gcc ate tct gaa cac ate gte aca gag att cgt tgc ttc 3069 

Leu Ala Trp Ala He Ser Glu His He Val Thr Glu He Arg Cys Phe 
"60 765 770 

gge ctt t.c ccz act cac ttc eat gaa ttg aca get etc gee gat cga 3117 

Gly Leu Fhe Ala Thr His Phe His Glu Leu Thr Ala Leu Ala Asp Arg 

775 780 785 

tac ccc aag ccc g-:c aag aac ctg cac gta gtc gcc ttc ate gge gat 3165 

Tyr Pro Lys Ser Val Lys Asn Leu His Val Val Ala Phe He Gly Asp 

790 795 800 

ggt act gat gat gac agt gaa gat aag aag tec aag egg aac cag gte 3213 

Gly Thr Asp Asp Asp Ser Glu Asp Lys Lys Ser Lys Arg Asn Gin Val 

805 810 815 

act ctt ctg tac egg gtc gaa cct gge att tgt gac cag tea ttc ggt 3261 

Thr Leu Leu Tyr Arg Val Glu Pro Gly He Cys Asp Gin Ser Phe Gly 

820 825 830 835 

ate cac gtc gee gaa ttg gtc cgc ttc ccg gag aag gtg gtc aac atg 3309 

He His Val Ala Glu Leu Val Arg Phe Pro Glu Lys Val Val Asn Met 
840 845 850 

gcc cgc cag aag gca gag gaa ett gaa gac ttt act tea tec gaa cag 3357 

Ala Arg Gin Lys Ala Glu Glu Leu Glu Asp Phe Thr Ser Ser Glu Gin 

355 860 865 

caa gac cag cag zza tec atg gcg ate gat aaa tac teg cag gaa gaa 3405 

Gin Asp Gin Gin Ser Ser Met Ala He Asp Lys Tyr Ser Gin Glu Glu 

870 875 880 

gtt gag gag gge agt gcc ctt etc aaa gcg acg ctg ctg aaa tgg aag 3453 

Val Glu Giu Gly Ser Ala Leu Leu Lys Ala Met Leu Leu Lys Trp Lys 

885 890 895 

teg gag acc gag cCc tct ggt aag gag ttg aca gtg gaa gag aag cga 3501 

Ser Glu Thr Glu Ser Ser Gly Lys Glu Leu Thr Val Glu Glu Lys Arg 

900 905 910 915 

cag ate atg cgt gat etc gtc aaa gca gat gag aag ctg caa gca aac 3549 

Gin He Me^ Arg Asp Leu Val Lys Ala Asp Glu Lys Leu Gin Ala Asn 
520 925 930 

aag gtc ttc cag ggt ate aag get tta tag attagtattt gegtettttt 3599 
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Lys Val Phe Gin Gly lie Lys Ala Leu 
935 " 940 

tctttctcgg ggtcatagcg gttcggcgtt tggaaggtgt caatctgtgt atgtgtgatc 3659 

tacggacatg aggacaaaat gtgtagggaa taatattatc caaaaatttt cgagtgattg 3719 

ctcctttgga catatcccct aggattaaag tggaaaggga gaaatcccat tcaactatat 3779 

cgacataagt cacqzzcaqa tcgcgagtct agacgctcac cggg 3823 



<210> 2 
<211> 940 
<212> PRT 

<213> Aspegillus cryzae 
<400> 2 

Met Ser Ser Arg Pro Glu Leu Lys Val Asp Asp Glu Val Gly Phe lie 

1 5 10 15 

Arg Phe Tyr Arg Ser Leu Ala Ala Asn Ser Asn Asp Glu Thr lie Arg 

20 25 30 

Val Phe Asp Arg Gly Asp Trp Tyr Ser Ala His Gly Ala Lys Ala Glu 

35 40 45 

Phe He Ala Arg Thr Val Tyr Lys Thr Thr Ser He Leu Arg Asn Leu 

50 55 60 

Gly Arg Ser Asp Ser Gly Gly Leu Pro Ser Val Thr Met Ser Val Thr 
65 70 75 80 

Val Phe Arg Asn Phe Leu Arg Glu Ala Leu Phe Arg Leu Asn Lys Arg 

85 90 95 

He Glu He Trp Gly Ser Val Gly Thr Gly Lys Gly His Trp Lys Leu 

100 105 110 

Val Lys Gin Ala Ser Pro Gly Asn Leu Gin Asp Val Glu Glu Glu Leu 

115 120 125 

Gly Ser Val Gly Gly Leu Ser Met Asp Ser Ala Pro He He Leu Ala 

130 135 140 

Val Lys He Ser Ala Lys Ala Ala Glu Ala Arg Ser Val Gly Val Cys 
145 150 155 160 

Phe Ala Asp Ala Ser Val Arg Glu Leu Gly Val Ser Glu Phe Leu Asp 

165 170 175 

Asn Asp He Tyr Ser Asn Phe Glu Ser Leu He He Gin Leu Gly Val 

180 185 190 

Lys Glu Cys Leu Val Gin Met Asp Ala Asn Lys Lys Asp Val Glu Leu 

195 200 205 

Gly Lys He Arg Ala He Ala Asp Ser Cys Gly He Ala He Ser Glu 

210 215 220 

Arg Pro Val Ala Asp Tyr Gly Val Lys Asp He Glu Gin Asp Leu Thr 
225 230 235 240 

Arg Leu Leu Arg Asp Glu Arg Ser Ala Gly Thr Leu Pro Gin Thr Glu 

250 255 
Leu Lys Leu Ala Met Gly Ser Ala Ser Ala Leu He Lys Tyr Leu Gly 

260 265 270 

Val Met Thr Asp Pro Thr Asn Phe Gly Gin Tyr Gin Leu Tyr Gin His 

275 280 285 

Asp Leu Ser Gin Phe Met Lys Leu Asp Ser Ser Ala Leu Arg Ala Leu 

290 295 300 

Asn Leu Met Pro Gly Pro Arg Asp Gly Ser Lys Ser Met Ser Leu Phe 
305 310 315 320 

Gly Leu Leu Asn His Cys Lys Thr Pro Val Gly Ser Arg Leu Leu Ala 
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325 330 



Gin 


Trp 


Leu 


Lys. 


oln 


Pro 


Leu 


Met 


Asp 


Leu 








5-! I 










345 






Gin 


T ^ • . 

Leu 


Va 1 


Glu 


Ala 


Phe 


Val 


Val 


Asn 
















360 








Gin 


Giu 


G-U 


His 


Leu 


Arg 


Ser 


He 


Pro 




^ T ft 










O T C 

375 








Lys 


Arg 


rne 


Gin 


Arg 


Lys 


Gin 


Ala 


Asn 


Leu 


"5 Q 










"3 Q ri 










Tyr 


GXn 


V ai 




lie 


Arg 


Leu 


Pro 


Gly 


Phe 










405 










410 


va J. 


Met 


Asp 


•Giu 


Glu 


Tyr 


Gin 


Thr 


Pro 


Leu 








4 20 










425 




n — — . 

Asn 


T «i » 

Leu 


Arg 


Ser 


His 


Ser 


Asp 


Ser 


Leu 


Ala 
















440 






blU 


Thr 


Thr 


Val 


Asp 


Leu 


Asp 


Ala 


Leu 


Glu 




450 










455 








Lys 


Pro 


Glu 


Phs 


Asp 


Glu 


Ser 


Leu 


Arg 


He 


4 65 










470 










Lys 


Leu 


Arg 


His 


Asp 


Met 


Gly 


Val 


Glu 


His 










485 










490 


Leu 


Asp 


Gin 


Asp 


He 


Glu 


Lys 


Lys 


Leu 


Phe 








500 










505 




His 


Gly 


Trp 


Cys 


Phe 


Arg 


Leu 


Thr 


Arg 


Asn 






515 










520 






Asn 


Lys 


Arg 


Glu 


Tyr 


Gin 


Glu 


Cys 


Ser 


Thr 




530 










535 








rne 


Thr 


Thr 


Ser 


Thr 


Met 


Gin 


Thr 


Leu 


Arg 


545 










550 








Ser 


Ser 


Asn 


Tyr 


Asn 


Arg 


Thr 


Gin 


Thr 


Gly 










565 










570 


TV — _ 

Asn 


val 


Ala 


Ala 


Ser 


Tyr 


Cys 


Pro 


Val 


Leu 








580 










585 




lie 


Ala 


His 


Leu 


Asp 


Val 


He 


Val 


Ser 


Phe 






c n c 

595 










600 






Ala 


fro 


inr 


Pro 


Tyr 


Ala 


Arg 


Pro 


Lys 


Met 




CIA 

blU 










615 








Asn 


1 nr 


Val 


Leu 


Lys 


Glu 


Ala 


Arg 


His 


Pro 












630 








Asp 


He 


osr 


n 


He 


Thr 


Asn 


Asp 


Val 


Ala 










64 5 










650 


Ser 


Phe 


Leu 


- i5 


I le 


Thr 


Gly 


Pro 


Asn 


Met 








660 










665 




He 


Arg 


Go. n 


I le 


vjly 


Val 


I le 


Ala 


Leu 


Met 






67 5 










680 






Val 


Pro 


Gys 


Thr 


Glu 


Ala 


Glu 


Leu 


Thr 


He 




690 










695 








Arg Val 


Gly 


A.ia 


Ser 


Asp 


Ser 


Gin 


Leu 


Lys 


705 










710 








Ala 


Glu 


wet 


T All 

Leu 


Glu 


Thr 


Ser 


Asn 


He 


Leu 










725 










730 


Ser 


Leu 


He 


He 


He 


Asp 


Glu 


Leu 


Gly 


Arg 








740 










745 




Gly 


Phe 


Gly 


Leu 


Ala 


Trp 


Ala 


He 


Ser 


Glu 






755 










760 






Arg 


Cys 


Phe 


Gly 


Leu 


Phe 


Ala 


Thr 


His 


Phe 




770 










775 








Ala 


Asp 


Arg 


Tyr 


Pro 


Lys 


Ser 


Val 


Lys 


Asn 


785 










790 
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335 



Ala 


Glu 


He 


Glu 


Lys 


Arg 








350 






Thr 


Glu 


Leu 


Arg 


Gin 


Thr 






365 








Asp 


Leu 


Tyr 


Arg 


Leu 


Ala 




380 










Glu 


Asp 


Val 


Val 


Arg 


Val 


395 










400 


vai 


7\ f r\ 

Asn 


Ser 


Leu 


Glu 


Asn 










415 




Glu 


Tnr 


Glu 


Tyr 


Thr 


Ser 








430 






Lys 


Leu 


Glu 


Glu 


Met 


Val 






445 








Asn 


His 


Glu 


Phe 


He 


He 




4 60 










He 


Arg 


Lys 


Lys 


Leu 


Asp 


475 










480 


Arg 


Arg 


Val 


Ala 


Arg 


Asp 










495 




Leu 


Glu 


Asn 


His 


Arg 


Val 








510 






Glu 


Ser 


Gly 


Cys 


He 


Arg 






525 








Gin 


Lys 


Asn 


Gly 


Val 


Tyr 




540 










Arg 


Glu 


His 


Asp 


Gin 


Leu 


555 










560 


Leu 


Val 


Asn 


Glu 


Val 


Val 










575 




Glu 


Arg 


Leu 


Ala 


Gly 


Val 








590 






Ala 


His 


Ala 


Ser 


Val 


His 






605 








His 


Pro 


Arg 


Gly 


Thr 


Gly 




620 










Cys 


Met 


Glu 


Met 


Gin 


Asp 


635 










640 


Leu 


Val 


Arg 


Asp 


Glu 


Ser 










655 




Gly 


Gly 


Lys 


Ser 


Thr 


Tyr 








670 






Ala 


Gin 


Thr 


Gly 


Cys 


Phe 






685 








Phe 


Asp 


Cys 


He 


Leu 


Ala 




700 










Gly 


Val 


Ser 


Thr 


Phe 


Met 


715 










720 


Lys 


Ser 


Ala 


Thr 


Ser 


Glu 










735 




Gly 


Thr 


Ser 


Thr 


Tyr 


Asp 








750 






His 


He 


Val 


Thr 


Glu 


He 






765 








His 


Glu 


Leu 


Thr 


Ala 


Leu 




780 










Leu 


His 


Val 


Val 


Ala 


Phe 


795 










800 



wo 00/50567 



8 



PCT/DKOO/00063 



T 1 o 


v3 A y 


Asp Gly Thr 


Asp Asp 


Asp 


Ser 


Glu 


Asp 


Lys 


Lys 


Ser Lys Arg 










o ^ n 










O 1 A 
O iU 










815 




Asn 


ox n 


Val 


Thr 

Q T 
o z U 


Leu 


Leu 


Tyr 


Arg 


val 
825 


(jlU 


Pro 


Caiy 


T 1 ^ 

i xe 


Cys 
830 


Asp 


Gin 


r 

oc; X 




Gly 


lie 


His 


Val 


Ala 


Ulu 
840 


Leu 


vax 


Arg 


rne 


Pro 
845 


Glu 


Lys 


Val 


Va 1 
V o ^ 


As n 
850 


Met 


Ala 




Gin 


Lys 

O C El 

855 


Mxa 


Pill 

uXU 


Pill 

uXU 


Leu 


uXU 

860 


Asp 


Phe 


Thr 


Ser 


OC: I. 


O J. u 


Gin 


Gin 


Asp 


Gin 


Gin 




ser 


rie L 


a 1 a 

rtxa 


JL xe 


Asp 


Lys 


Tyr 


Ser 


865 










O "1 A 










875 










880 






Glu 


Vs i 


S 5 5 


Glu 


Gly 


Ser 


Axa 


Leu 
890 


Leu 


Lys 


Ala 


Met 


Leu 
895 


Leu 


Lys 


Trp 


Lys 


Ser 
500 


Glu 


Thr 


Glu 


Ser 


Ser 

905 


Gly 


Lys 


Glu 


Leu 


Thr 
910 


Val 


Glu 


Glu 


Lys 


Arg 
915 


Gin 


lie 


Met 


Arg 


Asp 
920 


Leu 


Val 


Lys 


Ala 


Asp 
925 


Glu 


Lys 


Leu 


Gin 


Ala 
930 


Asn 


Lys 


Val 


Phe 


Gin 
935 


Gly 


lie 


Lys 


Ala 


Leu 
940 











<210> 3 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: oligo # U 
<400> 3 

gggaagctgc caggccccag tgtcagcctc ctatgctc 38 



<210> 4 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: oligo # L-G.T 
<400> 4 

gagcatagga ggctgacatt ggggcctggc agcttccc 38 



<210> 5 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: oligo # 
L-G. A. 

<400> 5 

gagcatagga ggctgacaat ggggcctggc agcttccc 38 



<210> 6 
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<2i:> 38 

<212> DNA 

<213> Artificial Sequence 

<22:> 

<223> Oescripzicr. of Artificial Sequence: oligo # 
L-G.G. 

<40C> 6 

gagcatagga ggc-gacagt ggggcctggc agcttccc 38 

<21C> 7 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<22C> 

<223> Descrip^icr. of Artificial Sequence: oligo # 
L-A.C. 

<40C> 1 

gagcatagga ggc::gac3cc ggggcctggc agcttccc 38 

<210> 8 
<211> 40 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: oligo # L-TG 
<400> 8 

gagcatagga ggctgacact gtggggcctg gcagcttccc 4 0 

<21C> 9 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: oligo # L-HOM 
<400> 9 

gagcatagga ggccgacacc ggggcctggc agcttccc 38 

<210> 10 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Descripticr. of Artificial Sequence: oligo # PR 
117858 



<400> 10 



wo 00/50567 jQ 

ggcncarath ggntgyttyg tncc 

<210> 11 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
117859 

<400> 11 

gcccangcna rnccraancc 

<210> 12 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
<400> 12 

gctcgaaaca tccaacatcc 

<210> 13 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
<400> 13 

gctgtgaatc acttgcacc 

<210> 14 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
<400> 14 

cttcataaac tgcgacaaat catgc 

<210> 15 

<211> 16 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 



PCT/DKOO/00063 

24 

oligo # Pr 

20 

oligo # 130740 

20 

oligo # 130741 

19 

oligo # 131928 

25 

oligo # 131929 
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<400> 15 

ggaggagcat cmcgc 



16 



<210> 16 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: oligo # 131930 
<400> 16 

ggaactcgaa gac-rtactt catcc 25 

<210> 17 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: oligo # 134608 



<210> 18 
<211> 15 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: oligo # 
134609 

<400> 18 

gtgctttgcg gacgc 15 

<210> 19 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: oligo # 134610 



<400> 17 

ccagaaactc gctaacacc 



19 



<400> 19 

caggacagta ggacgc 



16 



<210> 20 
<211> 17 
<212> DNA 



<213> Artificial Sequence 
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<220> 

<223> Descripuisn of Artificial 
<400> 20 

cgagcgatga actc-gc 

<210> 21 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 

<400> 21 

gcgttggtgg attaicc 

<210> 22 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
<400> 22 

cgttgcatct atcanetacc 

<210> 23 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
136106 

<400> 23 

ggtatatgai agatccaacg c 

<210> 24 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
137208 

<400> 24 

ccgcgtctcc aacaagatga atgg 



Sequence: oligo # 135320 

17 

Sequence: oligo # 135321 

17 

Sequence: oligo # 136105 

20 

Sequence: oligo # 

21 

Sequence: oligo # 



<210> 25 
<211> 21 
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<212> DNA 

<215> Artificial Sequence 
<220> 

<223> Descriptirr. of Artificial Sequence: oligo # 137207 
<400> 25 

ccgctttctc gggg-catag c 21 



<210> 26 
<211> 22 
<212> DNA 

<213> Artificial Sequence 

<22C> 

<223> Descripc ic.-. of Arcificial Sequence: primer 138149 



<210> 27 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: oligo # 
000120j2 

<400> 27 

tctgcgaatc gcttgcatcc cgaacgcgac aacac 35 

<21C> 28 
<21i> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; oligo # 
0OO120j4 



<210> 29 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description cf Artificial Sequence: oligo # 
000120j3 

<400> 29 

gttgtcgcgt ucggga-cca agcgattcgc agaag 35 



<400> 26 

cctttccact -taatcc'iaa gc 



22 



<400> 28 

gagctcagat ctcttaggtt ctggacgaga aga 



33 
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<210> 30 
<2il> 33 
<212> DNA 

<213> Arcificiil Sequence 
<220> 

<223> Description of Artificial Sequence: oligo # 

12 98-ThKA 



<400> 30 

gcaagcgcgc gcaatacatg gtgttttgat cat 



33 
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